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VIDEO CODING METHOD AND DECODING 
METHOD AND DEVICES THEREOF 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention can be used in low bit race video coding for 
teie-communicacive applications. It improves the temporal 
frame rate of the decoder output as '-veil as che overall picrure 
quality. r C 

2. Related an of the Invention 

In a typical hybrid transform coding algorithm such as the 
ITU-T Recommendation H.261 [1| and MPEG [2i motion 
compensation is used to reduce the amount of cempcrai 
redundancy in die sequence. In the H.261 coding scheme. ii 
the frames are "coded using only forward prediction, here- 
after referred to as P-frames. In the MPEG coding scheme, 
some frames are coded using bi-direction prediction, here- 
after referred to as B-frames. B-fraraes improve the effi- 
ciency of the coding scheme. No w the [11 is rTU-T Rec- :c 
ommendntion H.261 (Formerly CGTT Recommendation 
H.261; Codes for audiovisual services at p:<64 kbU/s 
Geneva. 1990 . and the [2| is BO/EEC 11172-2 1993 . 
Information technology — Coding of moving pictures and 
". v associated audio for digital storage media at up to about U 25 
Mbic/s - Part 2: Video. 

However, it introduces delay in the encoding and 
_ decoding, making ic unsuitable for applications in the com- 

S municarive services where delay is an important parameter. 

M= FIG. la and lb illustrates the frame prediction of H.261 and 

MPEG as described above. A new method of coding involv- 
*Z ing the coding of the P and B frames as a single unit, 

^ ; hereafter referred to as the PB-frame. was introduced. In this 

Nil scheme the blocks in che PB-frames are coded and trans- 
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mitted together thus reducing the total delay. In fact the total 
delay should not be more than a scheme using forward 
prediction only but at half the frame rate. 

FIG. 2a shows che PB-frame prediction. A PB-frame 
consists of two pictures being coded as one unit. The name 
PB comes from the name of picture types in MPEG where 
there arc P-frames and B-frames. Thus a PB-frame consists 
of one P- frame which is predicted from che last decoded 
P-frame and one B- frame which is predicted both from che 
last decoded P-frame and che P-frame currendy being 
decoded. This last picture is called B-frame because parts of 
it may be bi-directionally predicted from the past and future 
P-frame. 

FIG. 2£ shows the forward and bi-directional prediction 
for a block in che B-frame. hereafter referred co as a 3-block. 
Oulv the resion that overlaps with the corresponding block 
in the current P-frame. hereafter referred co as :hc P-bicck. 
is bi-directionally predicted The rest of the B-block is 
forward predicted from the previous frame. Thus only the 
previous frame is required in the frame store. The informa- 
tion from the P-frame is obtained from the P-block currently 
being decoded. 

Ln the PB- block only the motion vectors for the P-block 
is transmitted to the decoder. Trie torxixd and bacscvord 
motion vectors for che 3-block is derived from the P motion 
vectors. A linear motion model is used and che temporal 
reference of the 3 and ? frame is used to scale Ac motion 
vector appropriately. FIG. depicts the motion vector 
scaling and -he formula is shown below. 
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where 

NfV is the raotioo vector of the P-block. 

MV^. and MV fl arc the forward and backward motion 

vectors for the B-block. 
TR^ is the increment in the temporal reference from the 

last P- frame co the current B-frame. and 

TR P is the increment in the temporal reference from the 
last P-frame to the current P-frame. 

io Currcndy the method used in the prior an assumes a linear 
motion modeL However this assumption is not valid in a 
normal scene where the motion is typically not linear. This 
is especially true when the camera shakes and when objects 
are noc moving at constant velocities. 

15 A second problem involves the quantization and trans- 
mission of the residual of the prediction error in the B-block 
Currently the coefficients from the P-block and the B-block 
are interleaved in some scanning order which requires the 
B-block efficients to be transmitted even when they are all 

20 zero. This is not very efficient as it is quite often that there 
are no residual coefficients to transmit (all coefficients are 
zero). 

SUMMARY OF THE LVVENTION 

25 In order to solve the first problem, the current invention 
employs a delta motion vector to compensate for the non- 
linear motion. Thus it becomes necessary for the encoder to 
perform an additional motion search to obtain the optimum 
delta motion vector that when added to the derived motion 

30 vectors would result in the best match in the prediction This 
delta motion vectors are transmitted to the decoder at the 
block level only when necessary. A flag is used to indicate 
to the decoder if there arc delta motion vectors present for 
the B-block. 

35 For the second problem, this invention also uses a flag to 
indicate if there are coefficients for the B-block to be 
decoded. 

The operation of the Invention is described as follows. 
40 FIG. 3a shows the linear motion model used for the 
derivation of the forward and backward motioa vectors from 
the P-block motion vector and the temporal reference infor- 
mation As illustrated in FIG. 3b. this model breaks down 
when the motion is aot linear. The derived forward and 
45 backward motion vector is different from the actual motioa 
vector wheo the motion is aot linear. This is especially true 
when objects in the scene arc moving at changing velocities. 

In the current invenrioa the problem is solved by adding 
a small delta motion vector to the derived motioa vecror to 
50 compensate for the difference between the derived and true 
motion vector. Therefore the equatioas in f I) and (2) are qow 
replaced by equations (3) aad respectively. 

where 

MV is the motion vector of the P-block. 
M v oww i* wc ^lta motion vector. 
NfVy and NfV' a ' arc the new forward and backward 
motion vectors for the B-block according to the current 
invention. 

TR 3 is the increment in the temporal reference from the 

. last P-frame to the current B-frame. and 
TRp is the increment in the temporal reference from the 
last P- frame to the current P-frame. 
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Note: Equations (3) and (4) are used for the motion vector 
in the horizontal as well as the vertical directions. Thus 
the motion vectors are in pairs and there are actually two 
independent delta motion vectors, one each for the hori- 
zontal and vertical directions. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. la is a prior art which illustrates the prediction mode 
used in the ITU-T Recommendation IL261 Standard. 

FIG. 16 is a prior an which illustrates the prediction mode 
used In the ISO-EEC/JTC MPEG Standard. 

FIG. 2a illustrates the PB-frame prediction mode. 

FIG. 2b illustrates the B-block bi-directional prediction 
mode. 

FIG. 3a illustrates the Linear motion model. 

FIG. 3b illustrates the non-linear motion model of the 
current invention 

FIG. 4 illustrates the encoder functionality block diagram. 

FIG. 5 illustrates the B-block bi-directional prediction 
functionality block diagram. 

FIG. 6 illustrates the decoder runctionaitiry block dia- 
gram. 

PREFERRED EMBODIMENTS 

The preferred embodiment of the current invention is 
described here. FIG. 4 illustrates the encoding functionality 
diagram. The present invention deals with the method for 
deriving the motion vectors for the B-block. The encoding 
functionality is presented here for completeness of the 
embodiment. 

The encoding functionality block diagram depicts an 
encoder using a motion estimation and compensation for 
reducing the temporal redundancy in the sequence to be 
coded. The input sequences is organized into a first frame 
and pairs of subsequent frames. The first frame, hereafter 
referred to as the I- frame, is coded independent of all other 
frames. The pairs of subsequent frames, hereafter xefened to 
as PB-frame, consist of a B- frame followed by a P-frame. 
The P- frame is forward predicted based on the previously 
reconstructed I- frame or P-frame and the B- frame is 
bi-directionally predicted based on the previously recon- 
structed I- frame or P-frame and the information in the 
current P-framc. 

The input frame image sequence. 1. is placed in the Frame 
Memory 2. If the frame is classified as an I- frame or a 
P-frame it is passed through line 14 to the Reference 
Memory 3. for use as the reference frame in the motion 
estimation of the next PB- frame to be predictively encoded. 
The signal is then passed through line 13 to the Block 
Sampling module 4. where it is partitioned into spatially 
Qoa-ovcrlapping blocks of pixel data for further processing. 

If the frame is classified as an I-frarae. the sampled blocks 
are passed through line 16 co the DCT module 7. If the frame 
is classified as a PB-frame. the sampled blocks arc passed 
through line 17 to the Motion Estimation module 5. The 
Motion Estimation module 5 uses Lniormation from the 
Reference Frame Memory 3 and the current block 17 to e 
obtain the motion vector for that provides the best match for 
the P-biock The motion vector and the local reconstructed 
frame. 12. are passed through line 19 and 20. respectively, 
to the Motion Compensation module 6. The difference 
image is formed by subtracting the motion compensated o 
decoded frame. 21. from the current P-block. 15. This signal 
is then passed through line 22 to the DCT module 7. 
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Iq the DCT module 7. each block is transformed into the 
DCT domain coefficients. The transform coefficients are 
passed through line 23 to Quantization module 3. where they 
are quantized. The quantized coefficents are then passed 
5 through line 24 to the Run-length 3c Variable Length Coding 
module 9. Here the coefficients are entropy coded to form 
the Output Bit Stream. 25. 

If the current block is an I-block or a P-block. the 
quantized coefficients are also passed through line 26 to the 
W Inverse Quantization module 10. The output of the Inverse 
Quantization 10. is then passed through line 27 to the 
Inverse DCT module 11. If the current block is an [-block 
then the reconstructed block is placed, via line 28. in the 
13 Local Decoded Frame Memory 12. If the current block is a 
P-block then the output of the Inverse DCT 29 is added to 
the motion compensated output 21. to from the recon- 
structed block 30. The reconstructed block 30. is then placed 
in the Local Decoded Frame Memory 12. for the motion 
20 compensation of the subsequent frames. 

After the P-biock have been locally reconstructed, the 
information is passed again to the Motion Compensation 
Module 6. where the prediction of the B-block is formed. 
a FIG. 5 shows a more detailed functional diagram for the 
p B-block prediction process. The P-motion vector derived in 

yg the Motion Estimation module 51. is passed through line 57 

^ to the Motion Vector Scaling Module 53. Here the forward 

and backward motion vectors of the B-block is derived using 
*B ■" 30 the formula (I) and (2). respectively. In the present 

M 1 ■ . embodiment, an additional motion search around these 

5S vectors is performed in the Delta Motion Search module 54. 

IJfj to obtain the delta motion vector. In this embodiment the 

fcy ^ motion vector is obtained by performing the search for all 

delta motion vector values between -3 and 3. The delta 
motion vector value that gives the best prediction in terms of 
the smallest mean absolute difference in the pixel values of 
the B-block and the prediction block is chosen. The predic- 
if^ 40 tion is formed in the Bi-directional Motion Compensation 

SQ module 55. according to FIG. 2b using the information from 

Q the Local Decoded Frame Memory 52, and the Current 

q Reconstructed P-block 50. In the bi-directional prediction. 

~~ only information available in the corresponding P-biock is 

43 used to predict the B-block. The average of the P-block 
information and the information from the Local Decoded 
Frame is used to predict the B-block. The rest of the B-block 
is predicted using information from the Local Decoded 
50 Frame only. 

The prediction difference block is then passed through 
line 22 to the DCT module 7. The DCT coefficients arc then 
passed through line 23 co the Quantization module S. The 
result of the Quantization module 8. is passed through line 

55 24 to the Run-length & Variable Length Coding 9. In this 
module the presence of the delta motioa vector and the 
quantized residua] error in the Ourput Bitsrxeam 25. is 
indicated a variable leagth code. NOB which is the acronym 
for No B-block This dag is generated in Run-leagth &. 

60 Variable Length Coding module 9 based on whether there 
are residual error in the Quantization module 3 and delta 
motion vectors found in the Delta Motion Search module 54 
is not zero. Table' I provides the preferred embodiment of the 
variable tength code for the NOB flag. The variable length 

65 code of the NOB dag is inserted in the Output Bitsoream. 25. 
prior to the delta motion vector and quantized residual error 
codes. 
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n/ariable length code for the NOB riag't 

Quantized Residual Delta Motion 



NOB Ecor Coded Vectors Coded 



0 No No 

10 No Yes 

UO Yes No 

Ul Yes Yes 10 



FIG. 6 shows the functional block diagram for the 
decoder The Input Bit Stream. 31. is passed to the Variable 
Length & Run Length Decoding module 32. The block and ^ 
side information are extracted in this module. If che frame is 
a PB-frame then the bitstrenm is checked if any delta morion 
vector and/or quantized residual error coefficients present. 
The output of the module 32is passed through line 37 to the 
Inverse Quantisation module 33. The output of the Inverse 
Quantization 33. is then passed through line 38 to the 
Inverse DCT module 34. Here the coefficients are trans- 
formed back into the pixel values. 

If the current frame is an I-frame then the output of 
Inverse DCT 34. is passed through line 39 and stored in the ^ 
/ Frame Memory 42. 

If the current frame is a PB-frame. the side information 
containing the motion vectors are passed through line 45 to 
the Motion compensation module 36. The motion Compen- 
sation module 36. uses this information and the information ^ 
in the Local Decoded Memory. 35. to from the motion . 
compensated signal. 44. This signal is then added to the 
output of the Inverse DCT module 34. to form the recon- 
struction of the P-block. 

The Morion Compensation module 36. then uses the 3J 
additional information obtained in the reconstructed P-block 
to obtain the bi-directional prediction for the B-biock The 
B-block is then reconstructed and placed in the Frame 
Memory. 42. together with che P-block. 

By implementing this invention, the temporal frame rate 40 
of the decoded sequences can be effectively doubled at a 
fraction of the expected cost in bit rate. The delay is similar 
to that of the same sequence decoded at half the frame rate. 

As described above in the present invention a new pre- 
dictive coding is used to increase the temporal frame rate 45 
and coding efficiency without introducing excessive delay. 
Currently the motion vector for the blocks in the 
bi-directionally predicted frame is derived from the morion 
vector of the corresponding block in the forward predicted 
frame using a Linear motion model. This however is not 50 
effective when the motion in the image sequence is not 
linear. According to this invention, the efficiency of this 
method can be further improved if a non-linear morion 
model is used- In this model a delta morion vector is added 
to or subtracted from the derived forward and backward 55 
motion vector, respectively. The encoder performs an addi- 
tional search to determine if there is a need for the delta 
motion vector. The presence of this delta morion vector us 
the transmitted bitstream is signalled to the decoder which 
then takes the appropriate action to make use ot the delta 60 
motion vector to derive the effective forward and back-ward 
motion vectors for the bi-directionally predicted block 
What is claimed: 

L A method for encoding a sequence of video image 
frames comprising the steps of: 65 
dividing a source sequence into a set of group of pictures, 
each group of pictures comprising a first frame 
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(I-frame) followed by a plurality of pauTof predic- 
tively encoded frames (PB-frame pairs), each PB-frame 
pair having a corresponding P-btock; 
dividing each I-frame or PB-frame pair into a plurality of 

spatially non-overlapping blocks of pixel data: 
encoding the blocks from the I-frame (I-Wocks; indepen- 
dently from any other frames in the group of pictures; 
predictively encoding the blocks from the second ^frame of 
the PB-frame pair (P-blocks). based on the I-blocks in 
the previous I-frame or the P-blocks in the previous 
PB-frame pair, 
bi-directionally predictively encoding the blocks from the 
first frame of the PB-frame pair (B-blocks). based on 
the I-blocks in the previous I-frame or the P-blocks in 
the previous PB-frame pair and the correspoading 
P-block in the current PB-fraroe pair, 
deriving a scaled forward motion vector and a scaled 
• backward motion vector for the B-block by scaling fce 
m odon vector of the corresponding P-block in the 
current PB-frame pair, 
obtaining a final forward motion vector for the Mtockby 
adding a delta motion vector to the scaled forward 
motion vector and 
* obtaining a final backward motion vector for the B-block 
by subtracting the delta motion vector from the scaled 
backward motion vector. 

2. A method for encoding a sequence of video image 
frames according to claim 1. wherein 

the scaling of the motion vector is based on a temporal 
reference of the first and second frames of the 
PB-frame pair. ; . - . _ ' 

3. A method for encoding a sequence of video usage 
frames according to claim 1. further comprising the step of 
EE ; an encoded output, wherein the encoded output is a 
bitstream comprising: 

temporal reference information for the fine and second 
frames of the PB-frame pairs: 
« motion vector information for the P-blocks: 

quantized residual error information for the P-blocks: 
delta motion vector information for the B-blocks: and 
quantized residual error information for the B-Woc*- 
2 A method for encoding a sequence of video image 
43 frames according to claim 3. wherein 

Ae output bitstream contains additional information to 

indicate the presence of at least one of: 
mtf delta motion vector information for the B-blocks: and 
50 the quantized residual error information for thcB-bto 
5. A method for decoding a sequence or video image 
frames comprising the steps of: 
decoding the compressed video image sequence as a sec of 
group of pictures, each group of pictures comprising an 
» ^frame foLlowed by a plurality of PB-trarae pairs, each 
PB-frame pair having a corresponding P-block: 
decoding each I-frame or PB-frame pair into a plurality or 

spatiallv non-overlapping blocks ot pixel data: 
decoding 'the I-blocks from the I-frame independently 
60 from .inv other frames in the group of pi ctures: 

predictivel'v decoding the P-blocks from the second trame 
of the PB-frame pair based on the I-blocks in the 
previous I-frame or the P-blocks in the previous 
PB-trame pair: 
bi-directionally predictively decoding the B-Wocta » trom 
the first frame of the PB-frame pair based on the 
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I-blocks in the previous I- frame or the P-blocks in the 

previous PB-frame pair and the corresponding P-block 

in the current PB-frame pair, 
deriving a scaled forward motion vector and a scaled 

backward motion vector for the B-block by scaling the 5 

motion vector of the corresponding P-block in the 

current PB-frame pair; 
obtaining a final forward motion vector for the B-block by 

adding a delta motion vector to the scaled forward 

motion vector, and 10 
obtaining a final backward motion vector for the B-block 

by subtracting the delta motion vector from the scaled 

backward motion vector. 

6. A method for decoding a sequence of video image 
frames according to claim 5. further comprising the step of 13 
forming, a decoded output, wherein the decoded output is 
responsive to a bitstream comprising: 

temporal reference information for the first and second 

frames of the PB-frame pairs; 20 
motion vector information for the P-blocks: 
quantized residual error information for the P-blocks: 
the delta motion vector information for the B-biocks: and 
quantized residual error information for the B-blocks, 

7. A method for decoding a sequence of video image 15 
frames according to claim 6. wherein 

the bitstream contains additional information to indicate 

the presence of at least one of: 
the delta motion vector information for the B-b locks: and ^ 
the quantized residual error information for the B-biocks. 

8. A method of decoding a sequence of video image •/ 
frames according to claim 5. wherein 

the scaling is based oq a temporal reference of the first and 
second frames of the PB-frame pair. 33 

9. An apparatus for encoding a sequence of video image 
frames comprising: 

means for encoding each frame in a sequence of video 
image frames into a set of group of pictures, each group 
of pictures comprising an I-frame foUowed by a plu- 
rality of PB-frame pairs; 

means for dividing the I-frame and the PB-frame pair into 
a plurality of spatially aon-overiapping blocks of pixel 
data: 

means for encoding and decoding the I-blocks of the 
I-frame independently from any other frames in the 
group of pictures: 

means for storing the decoded I-blocks to prcdictively 
encode subsequent frames: 5C 

means for prcdictively encoding and decoding the 
P-blocks of the secoad frame of the PB-frame pair 
based on the I-blocks in the previous I-frame or the 
P-blocks in the previous PB-frame pair: 

means for storing the decoded P-blocks to predictively 35 
encode subsequent frames: 

means for deriving a scaled forward raotioa vector and a 
scaled backward motion vector for a B-block by scaling 
the raotioa vector of the corresponding P-block in the 
current PB-frame pair, the B-block being the first frame ^ 
of the PB-frame pair: 

means for obtaining a final forward motion vector for the 
B-block by adding a delta raotioa vector to the scaled 
forward raotioa vector: 

means for obtaining 3 final backward motion vector for 65 
the B-block by subtracting the same delta motion 
vector from the scaled backward motion vector: and 
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means for encoding the B-blocks of the first frame of the 
PB- frame pairs based on the I- blocks in the previous 
I- frame or the P-biocks in the previous PB-frame pair 
and the corresponding P-block in the current PB-frame 
5 pair using the final forward motion vector and the final 
backward motion vector. 
10. An apparatus for decoding a sequence of video image 
frames comprising: 
means for decoding each frame in a sequence of video 
10 image frames into a set of group of pictures, each group 
of pictures comprising an I- frame followed by a plu- 
rality of PB-frame pairs: 
means for decoding the I- blocks of the I-frame indepen- 
t5 dentiy of any other frames in the group of pictures: 
means for storing the decoded I-blocks to predictiveiy 

decode subsequent frames: 
. means for decoding the P-blocks of the second frame of 
the PB-frame pair based on the I-blocks in the previous 
20 I-frame or the P-biocks in the previous PB-frame pair, 
means for storing the decoded P-biocks to predictiveiy 

decode subsequent frames: 
means for deriving a scaled forward motion vector'and a 
scaled backward motion vector for a B-biock by scaling 
25 ' the motion vector of the corresponding P-block in the 
current PB-frame pair, the B-block being the first frame 
ygi of the PB-frame pain 

m means for obtaining a final forward motion vector for the 

B-biock by adding a delta motion vector to the scaled 
™ forward motion vector; 

^ means for obtaining a final backward motion vector for 

83 the B-biock by subtracting the delta motion vector to 

fj| the scaled backward motion vector and 

%j 35 means for decoding the Brblocks of the first frame of the 

PB-frame pairs based on the I-blocks in the previous 
? a I-frame of the P-biocks in the previous PB-frame pair 

and the corresponding P-block in the current PB-frame 
Q pair using the final forward motioa vector and the final 

jy. 40 backward motion vector. 

s «I 11. A method for encoding a sequence of video image 

^ frames comprising the steps of: 

y dividing a source sequence into a plurality of groups of 

Lf pictures, each group of pictures comprising a fir* 

^ 45 frame (I-frame) followed by a plurality of pairs of 

predictiveiy encoded frames (PB-frame pairs): 
dividing each [-frame or PB-frame pair into a plurality of 
blocks: 

50 encoding the blocks from the I-frame: 

predictiveiy encoding the blocks from the second frame of 

the PB-frame pair: 
bi-directionally predictiveiy encoding the blocks from the 

first frame of a PB-frame pair (B-blocks j: 
55 deriving a scaled forward motion vector and a scaled 

backward motion vector for the B-block: 
obtaining a final forward motion vector for the B-block by 
' adding a delta motion vector to the scaled forward 

motion vector: and 

60 obtaining a final backward motion vector for the B-block 

by subtracting the delta motion vector from the scaled 
backward motion vector. 
[2. An apparatus for encoding a sequence of video image 

61 frames comprising: 

means for dividing a source sequence into a pluraliry of 
groups of picruxes. each group of pictures comprising a 



first frame (I- frame) followed by a piuralit^Tpairs of 
predictivery encoded frames (PB-frame pairs): 
means for dividing each I-frarae or PB-frame pair into a 
plurality of blocks; 

means for encoding me blocks from the I- frame: 
means for predictiveiy encoding the blocks from the 

second frame of the PB-frame pair; 
me ans for bi-directionally predictiveiy encoding the 

blocks from the first frame of a PB-frame pair 

(B-blocks); 
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means for deriving a scaled forward rnotioa vo^^^ni a 

scaled backward motion vector for the B-bll 
means for obtaining a final forward mociOQ vector for the 

B-block by adding a delta motion vector co the scaled 

forvard morion vector, and 
means for obtaining a anal backward motion vector for 

the B-blcckby subtracting che delta motioa vector Srom 

the scaled backward rnotioa vector. 



